CBS Genome Atlas Database: a dynamic storage for bioinformatic results and sequence data
نویسندگان
چکیده
UNLABELLED Currently, new bacterial genomes are being published on a monthly basis. With the growing amount of genome sequence data, there is a demand for a flexible and easy-to-maintain structure for storing sequence data and results from bioinformatic analysis. More than 150 sequenced bacterial genomes are now available, and comparisons of properties for taxonomically similar organisms are not readily available to many biologists. In addition to the most basic information, such as AT content, chromosome length, tRNA count and rRNA count, a large number of more complex calculations are needed to perform detailed comparative genomics. DNA structural calculations like curvature and stacking energy, DNA compositions like base skews, oligo skews and repeats at the local and global level are just a few of the analysis that are presented on the CBS Genome Atlas Web page. Complex analysis, changing methods and frequent addition of new models are factors that require a dynamic database layout. Using basic tools like the GNU Make system, csh, Perl and MySQL, we have created a flexible database environment for storing and maintaining such results for a collection of complete microbial genomes. Currently, these results counts to more than 220 pieces of information. The backbone of this solution consists of a program package written in Perl, which enables administrators to synchronize and update the database content. The MySQL database has been connected to the CBS web-server via PHP4, to present a dynamic web content for users outside the center. This solution is tightly fitted to existing server infrastructure and the solutions proposed here can perhaps serve as a template for other research groups to solve database issues. AVAILABILITY A web based user interface which is dynamically linked to the Genome Atlas Database can be accessed via www.cbs.dtu.dk/services/GenomeAtlas/. SUPPLEMENTARY INFORMATION This paper has a supplemental information page which links to the examples presented: www.cbs.dtu.dk/services/GenomeAtlas/suppl/bioinfdatabase.
منابع مشابه
BIOINFORMATICS APPLICATIONS NOTE CBS Genome Atlas Database: A dynamic storage for bioinformatic results and sequence data
متن کامل
Cloning and molecular characterization of Omp31 gene from Brucella melitensis Rev 1 strain
Brucellosis, caused by the genus Brucella bacterium, is a well-known infection among domestic animals. Considering the serious economic and medical consequences of this infection, various preventive efforts have been made through using recombinant vaccines, based on outer membrane protein (OMP) antigens of Brucella species. The objective of the present study was to clone, analyze the sequence, ...
متن کاملA transcriptional miRNA-gene network associated with lung adenocarcinoma metastasis based on the TCGA database.
Lung adenocarcinoma is the most common subtype of non-small cell lung cancer (NSCLC), leading to the largest number of cancer-related deaths worldwide. The high mortality rate may be attributed to the delay of detection. Therefore, it is of great importance to explore the mechanism of lung adenocarcinoma metastasis and the strategy to block metastasis of the disease. We searched and downloaded ...
متن کاملOn Bioinformatic Resources
A starting point of curating bioinformatic resources for the public is marked by the establishment of the US National Center for Biotechnology Information (NCBI) in 1988 [1]. One of its many purposes is certainly to echo the initiative of the Human Genome Project (HGP) –– when two landmark reports were published at the same time: ‘‘Mapping and Sequencing the Human Genome’’ by the National Resea...
متن کاملBioXpress: an integrated RNA-seq-derived gene expression database for pan-cancer analysis
BioXpress is a gene expression and cancer association database in which the expression levels are mapped to genes using RNA-seq data obtained from The Cancer Genome Atlas, International Cancer Genome Consortium, Expression Atlas and publications. The BioXpress database includes expression data from 64 cancer types, 6361 patients and 17 469 genes with 9513 of the genes displaying differential ex...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 20 18 شماره
صفحات -
تاریخ انتشار 2004